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^ ! ABSTRACT 

Context. We study the bright Seyfert 1 galaxy Mrk 509 with the Reflection Grating Spectrometers (RGS) of XMM-Newton using the 
RGS multi-pointing mode of XMM-Newton for the first time in order to constrain the properties of the outflow in this object. 
Aims. We want to obtain the most accurate spectral properties from the 600 ks spectrum of Mrk 509 which has excellent statistical 



o 



^ ' quality. 

' Methods. We derive an accurate relative calibration for the effective area of the RGS, derive an accurate absolute wavelength cal- 

ibration, improve the method for adding time-dependent spectra and enhance the efficiency of the spectral fitting by two orders of 
magnitude. 
^ ' Results. We show the major improvement of the spectral data quality due to the use of the new RGS multi-pointing mode of XMM- 

r~^ ' Newton. We illustrate the gain in accuracy by showing that with the improved wavelength calibration the two velocity troughs observed 

1/-N I in UV spectra are resolved. 

)si, ', Key words. Galaxies; active - quasars: absorption lines - X-rays: general 

^—r 1. Introduction nents are present, how large is the turbulent line broadening, 

. etc. Our campaign on Mrk 509 is centred around ten obser- 

. . ^ Outflows from Active Galactic Nuclei (AGN) play an important yations with XMM-Newton spanning seven weeks in the Fall 

>• . role in the evolution of the super-massive black holes (SMBH) of 2009. The properties of the outflow are derived from the 

at the centres of the AGN, as well as on the evolution of the high-resolution X-ra y spectra taken w ith the Reflection Grating 



X 



H 



host galaxies and their suiToundings. In order to better under- Spectrometers rRGS. Iden Herder et al.i 20011 of XMM-Newton. 

J^ stand the role of photo-ionised outflows, the geometry must be The time-averaged RGS spectrum is one of the best spectra ever 

determined. In particular, our goal is to determine the distance taken by this instrument, and the statistical quality of this spec- 

of the photo-ionised gas to the SMBH, which currentiy has large trum can be used to improve the current accuracy of the cali- 

uncertainties. For this reason we have started a large monitor- bration and analysis tools. This gives rise to challenges for the 

ing campaign on one of the brig htest AGN wit h an outflow, the analysis of the data. The methods developed here also apply to 

Seyfert 1 galaxy Mrk 509 ( Kaastra et al.||2011|). The main goal other time-variable sources. Therefore we describe fliem in some 

of this campaign is to track the response of the photo-ionised gas detail in this paper, 
to the temporal variations of the ionising X-ray and UV contin- 
uum. The response time immediately yields the recombination 
time scale and hence the density of the gas. Combining this with 

the ionisation parameter of the gas, the distance of the outflow por this work we have derived a list of the strongest absorp- 

to the central SMBH can be determined. tion lines in the X-ray spectrum of Mrk 509, and we perform 

The first step in this process is to accurately determine the some simple line diagnostics on a few of the most prominent 

physical state of the outflow: what is the distribution of gas as feature s. The full time-avera ged spectrum will be presented else- 

a function of ionisation parameter, how many velocity compo- where dPetmers et alJl201l1) . 
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2. Data analysis 



Table 1. Observation log. 



Obs. 


ID 


Start date 


net exposure 


nr. 






(ks) 


1 


0601390201 


15 Oct 2009 


60 


2 


0601390301 


19 Oct 2009 


53 


3 


0601390401 


23 Oct 2009 


61 


4 


0601390501 


29 Oct 2009 


60 


5 


0601390601 


2 Nov 2009 


63 


6 


0601390701 


6 Nov 2009 


63 


7 


0601390801 


10 Nov 2009 


61 


8 


0601390901 


14 Nov 2009 


60 


9 


0601391001 


18 Nov 2009 


65 


10 


0601391101 


20 Nov 2009 


63 



Table [T] gives some details on our observations. We quote 
here only exposure times for RGS. The total net exposure time 
is 608 ks. No filtering due to enhanced background radiation was 
needed for these observations, as the background was very low 
and stable for the full duration of our campaign. 

The campaign consisted of ten different observations. Each 
observation was pointed a bit differently to obtain slightly differ- 
ent positions of the spectra on the detectors (the Multi-Pointing 
Mode of RGS; steps of 0, ±15" and ±30"). In this way holes in 
the detector, due to bad CCD columns and pixels and CCD gaps, 
changed position over the spectra, allowing all spectral bins to be 
sampled at some time. In addition this procedure will limit out- 
liers in individual wavelength bins, due to isolated noisy CCD 
pixels, since the noise of these pixels will be spread over more 
wavelength bins. 

After all exposures were separately processed using the stan- 
dard RGS pipeline of the XMM-Newton data analysis system 
SAS version 9, noise due to remaining noisy CCD columns and 
pixels was decreased further using the following process. Two- 
dimensional plots of the spectral image (cross-dispersion posi- 
tion versus dispersion angle) and the CCD-puIse height versus 
spectral dispersion were plotted in the detector reference frame 
for each separate CCD node, but with all exposures combined. 
Since spectral features will be smeared out in these plots, possi- 
ble CCD defects like hot or dead columns and hot or dead pix- 
els are clearly visible. In this way a number of additional bad 
columns and pixels were manually identified for each detector 
CCD node and added to the SAS bad pixel calibration file. After 
this, all exposures were reprocessed, using the new bad pixel ta- 
bles. 



3. Combination of spectra 

To obtain an accurate relative effective area and absolute wave- 
length calibration, we need the signal-to-noise of the combined 
600 ks spectrum. Therefore we first describe how to accurately 
combine the individual spectra, and discuss the effective area and 
wavelength scale later. 

3.1. Combination of spectra from fhe same RGS and 
spectral order 

Detectors like the RGS yield spectra with lacking counts in parts 
of the spectrum due to the presence of hot pixels, bad columns 
or gaps between CCDs. This gives rise to problems when spectra 



taken at different times have to be combined into a single spec- 
trum, in particular when the source varies in intensity and the 
lacking counts fall on different wavelengths for each individual 
spectrum. For RGS this is for example the case when the multi- 
pointing mode is used like in our present Mrk 509 campaign, or 
when data of different epochs with slightly different pointings 
are combined, or if bad columns have (dis)appeared between 
different observations due to the transient nature of some bad 
pixels. 



800 
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spectrum 1 + spectrum 2 = total spectrum residuals 



o 
o 
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-200 



Fig. 1. Illustration that spectra with different flux and different 
missing bins cannot be simply added. Both spectrum 1 and 2 
have 1 s exposure but have different constant expected count 
rates of 400 and 200 counts s"'. In spectrum 1 pixel A is miss- 
ing, in spectrum 2 pixel B. The thick solid line in the total spec- 
trum indicates the predicted number of counts by straightfor- 
wardly adding the response matrices for both spectra with equal 
weights. 



Under the above conditions the SAS task rgscombine should 
not be used. We illustrate this with a simplified example (Fig.lTJ. 
Consider two spectra of the same X-ray source, labelled 1 and 2, 
with 7 spectral bins each (Fig.[T]i- The source has a flat spectrum 
and the flux per bin in spectrum 1 is twice the flux of spectrum 
2. In spectrum 1, bin A is missing (zero counts), and in spectrum 
2, bin B is missing. The combined spectrum has 600 counts, 
except for bins A and B with 200 and 400 counts, respectively. 
However, when the response matrices of both observations are 
combined, it is assumed that bins A and B were on for 50% of 
the time, hence the model predicts 300 counts for those bins. 
Clearly, the data show a deficit or excess at those bins, which an 
observer could easily misinterpret as an additional astrophysical 
absorption or emission feature at bins A and B. 

How can this problem be solved? One possibility is to fit all 
spectra simultaneously. However, when there are many different 
spectra (like the 40 spectra obtained from 2 RGS detectors in two 
spectral orders for ten observations that we have for our Mrk 509 
data), this can be a cumbersome procedure as the memory and 
CPU requirements become very demanding. Another option is to 
rigorously discard all wavelength bins where even during short 
periods (a single observation or a part of an observation) data are 
lacking. This solution will cause a significant loss of diagnostic 
capability, as many more bins are discarded and likely several of 
those bins will be near astrophysically interesting features. 
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Here we follow a different route. It is common wisdom (and 
this is also suggested by the S AS manual) not to use fluxed spec- 
tra for fitting analysis. The problem is the lack of a well-defined 
redistribution function for that case. We will show later in Sect.|4] 
that we can obtain an appropriate response matrix for our case, 
allowing us to do analysis with fluxed spectra under the condi- 
tions given in that section. 

We first create individual fluxed spectra using the SAS task 
rgsiluxer. We use exactly the same wavelength grid for each 
observation. We then average the fluxed spectra using for each 
spectral bin the exposure times as weights. This is done for all 
bins having 100% exposure and for all spectra that are to com- 
bined. 

For bins with lacking data (like bins A and B of Fig.lTJ we 
have to follow a diff'erent approach. If the total exposure time 
of spectrum k is given by t^, and the eff'ective exposure time 
of spectral bin j in spectrum k is given by tkj, we clearly have 
tkj < h for "problematic" bins j, and we will include in the final 
spectrum only data bins with tkj > ftk, with < / < 1 a tun- 
able parameter discussed below. However, due to the variability 
of the source the average flux level of the included spectra k for 
problematic bin j may differ from the flux level of the neigh- 
bouring bins j - I and j + I that are fully exposed. To correct 
for this, we assume that the spectral shape (but not the overall 
normalisation) in the local neighbourhood of bin j is constant in 
time. From these neighbouring bins (with fluxes Fj^i and Fj+\) 
we estimate the relative contribution R to the total flux for the 
spectra k that have sufficient exposure time tkj for bin /', i.e. 




16.4 16.6 

Wavelength (A) 

Fig. 2. Spectral region near the gap between CCD 5 and 6 of 
RGS2, containing the ls-3p line of Ovm at 16.55 A from the 
outflow. From top to bottom we show the fluxed spectra (arbi- 
trarily scaled and shifted along the flux axis) for three values of 
/. Using the conservative setting / = 1, a major part of the spec- 
trum is lost. With / = no data are lost but of course between 
16.3 and 16.55 A the error bars are slightly larger due to the 
shorter exposure time in that wavelength region. 



R = 



Z tkFj^i 

k, hi>ftt 

2 tkFj^i 

k 



(1) 



and similar for the other neighbour at j -H 1 . We interpolate the 
values for R linearly at both sides to obtain a single value at the 
position of bin j. The flux for bin j is now estimated as 



Fj- 



Z t,Fj 

k, tkj>ftt 
k 



(2) 



Example: forbinA in Fig.[I]we have/? = 200x1/(400x1h-200x 
1) - 1/3, and hence using the flux measurement in spectrum 2, 
we have Fa = (200xl)/((l/3)x2) = 300 counts s^', as it should 
be. 

This procedure gives reliable results as long as the spectral 
shape does not change locally; as there is less exposure at bin 
j, the error bar on the flux will obviously be larger However, 
when there is reason to suspect that the bin is at the location of 
a spectral line that changes in equivalent width, this procedure 
cannot be applied! 

To give the user more flexibility, we have introduced the min- 
imum exposure fraction / in ([T]). For / = 0, we obtain the best 
results for spectra with constant shape (as every bit of informa- 
tion that is available is used). On the other hand, if / = 1, only 
those data bins will be included that have no bad bins in any of 
the observations to be combined. The advantage in that case is 
that there is no bias in the stacked spectrum, but a disadvantage 
is that a significant part of the spectrum may be lost, for exam- 
ple near important diagnostic lines. In particular for the multi- 
pointing mode the purpose of which is to have different wave- 
lengths fall on different part of the spectrum and thus to have a 
measurement of the spectrum at all wavelengths, this can be a 



problem. We illustrate this for the region near the chip gap be- 
tween CCD 5 and 6 of RGS2 (see Fig. |2]l. For / < 1 some data 
points have large error bars due to the low effective exposure of 
some bins containing missing columns, and the corresponding 
low number of counts A^ yielding large relative errors (~ A^""^). 
In our further analysis we use f - 0- 

Finally, in the combined spectrum due to the binning and 
randomisation procedures within the SAS task rgsiluxer, it is still 
possible that despite careful screening for bad pixels, a few bad 
bins remain in the fluxed spectrum, often adjacent or close to dis- 
carded bins. For this reason, we check for any bins that are more 
than 3cr below their right or left neighbouring bin (taking the 
statistical errors of both into account). Typically, the algorithm 
finds a few additional bad bins in an individual observation, 
which we also discard from our analysis. Only for very strong 
isolated emission lines with more than 1500 counts in a single 
observation our method would produce false rejections near the 
bending points of the instrumental spectral redistribution func- 
tion, because then the spectral changes for neighbouring bins 
are stronger than 3cr statistical fluctuations, but our spectra of 
Mrk 509 do not contain such sharp and strong features. 

We have implemented the procedures presented in this 
Section in the program rgsJuxcombine that is available within 
the publicly available SPEX distributiorQ. 

Finally, we note that in principle another route exists to com- 
bine the spectra. This would be to run rgscombine but to modify 
by a user routine the column AREASCAL so that the count rates 
are properly corrected for the missing bins. It can be shown that 
in the combined spectrum AREASCAL must be scaled by 1 /R, 
with R derived as described above, in order to obtain robust re- 
sults. It requires however the development of similar tools as 
rgsJuxcombine to do this modification. 



www.sron.nl/spex 
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3.2. Combination of spectra from different RGS or spectral 
order 

In the previous section we showed how to combine spectra from 
the same RGS and spectral order, but for different observations. 
However, we also want to combine RGSl and RGS2 spectra, 
and spectra from the first and second spectral order We do this 
by simply averaging the fluxed spectra, using for each bin the 
statistical errors on the flux as weight factors. Because of the 
lower effective area in the second spectral order, second order 
spectra get in this way less weight then first order spectra, their 
weights are typically between 20% (near 18 A) and 50% (near 
8 A) of the first-order spectral weights. 

In Sect. ID we show how to create a response matrix for the 
fluxed RGS spectra. The same weights that are used to determine 
the relative contributions of the different spectra to the combined 
spectrum are also used to weigh the contributions from the corre- 
sponding redistribution functions. To avoid discontinuities near 
the end points of second order spectra or near missing CCDs, 
we pay attention to small effective area corrections, as outlined 
in Sect. |5] 



4. Response matrix 

We have explained before that for time-variable sources standard 
SAS tasks like rgscombine cannot be used to combine the 10 in- 
dividual spectra of Mrk 509 or any other source into a single re- 
sponse matrix. For that reason, we have combined fluxed spectra 
as described in the sections above. The only other feasible alter- 
native would be to fit the 10 individual spectra simultaneously. 
But with 2 RGS detectors, 2 spectral orders and 10 observations, 
this adds up to 40 individual spectra. The memory occupied by 
the corresponding response matrices is 2.0 Gbyte. Although our 
fitting program SPEX is able to cope with this, fitting becomes 
cumbersome and error searches extremely slow, due to the large 
number of matrix multiplications that have to be performed. 

For our combined fluxed spectra, fitting is much faster be- 
cause we have only a single spectrum and hence only need one 
response matrix. As we use fluxed spectra, we can use a simple 
response matrix with unity effective area, and the spectral re- 
distribution function given by the redistribution part of the RGS 
matrix. Unfortunately, the RGS response matrix produced by the 
SAS rgsrmfgen task combines the effective area and redistribu- 
tion part into a single data file. Furthermore, due to the multi- 
pointing mode that we used and due to transient bad columns, 
the matrix for each of the 10 observations will be slightly differ- 
ent. 

Therefore, we have adopted the following approach. For a 
number of wavelengths (7 to 37 A, step size 2 A), we have fit- 
ted the RGS response to the sum of three or four Gaussians. 
This gives a quasi-diagonal matrix resulting in another speed- 
up of spectral analysis. For these fits to the redistribution func- 
tion, we have omitted the data channels with incomplete expo- 
sure (near chip boundaries, and at bad pixels). The parameters of 
the Gaussians (normalisation, centre and width) were then mod- 
elled with smooth analytical functions of wavelength. We paid 
most attention to the peak of the redistribution function, where 
most counts are found. 

The following parametrisation describes the RGS redistribu- 
tion functions well (all units are in A): 



Table 2. Parameters of the RGS redistribution function. 



RGS 


RGSl 


RGS2 


RGSl 


RGS2 


Ordei 


-1 


-1 


-2 


-2 


Ol 


0.0211 


0.0237 


0.0100 


0.0118 


02 


0.0514 


0.055 


0.018 


0.027 


03 


0.0105 


0.016 


-0.0572 


0.017 


04 


- 


- 


0.404 


0.339 


bi 


0.00028 


0.00032 


0.00031 


0.00035 


bi 


0.00039 


0.00058 


0.00075 


0.00080 


bi 


0.021 


0.020 


0.021 


0.00066 


b, 


- 


- 


-0.01 


- 


Ci 


-0.000405 


-0.000346 


-0.000692 


- 


di 


0.1068 


0.1314 


-1.1022 


0.0084 


d, 


- 


- 


1.5276 


1.2276 


d. 


- 


_ 


0.3190 


-0.1211 


ei 


0.0125 


0.0086 


0.289 


0.0153 


ea 


- 


- 


-0.223 


-0.140 


64 


- 


- 


-0.017 


0.0833 


/2 


-0.00024 


-0.00017 


-0.0215 


-0.00024 


/3 


~ 


- 


0.016 


0.0075 


/4 


- 


- 


-0.0014 


-0.0080 


g2 


- 


- 


0.00053 


-0.000009 


83 


- 


- 


-0.00044 


-0.00015 


^4 


- 


- 


0.00011 


0.00022 


o 











.« ° 



o I 




20 
wavelength (A) 

Fig. 3. Comparison of the redistribution function for RGS2, first 
order as delivered by the SAS (thin solid line) to our approxima- 
tion using three Gaussians (thick lines), for four different photon 
wavelengths. 



(Tj — Oj + biA + c,/l , 

/ = 1 : A^j = 1 _ A?2 - A^3 - A^4, 

/ -2, 1st order only : A^,- = dj + e,vl + fi?? + gjA^ , 

i -2-4, 2nd order only : A^; - di + CjA + fiA^ + giA^, 



i = 3, 1st order RGSl only : A^3 = -0.065 + 2.5 IX 
i = 3, 1st order RGS2 only : N^ = -0.075 + 2.5 1 A^ 



0.7 



0.7 



(4) 
(5) 
(6) 
(7) 
(8) 
(9) 



R{A',A)^Y^ 

i=I 



Ni 



V27r( 



-{A- A' f 120-] 



(3) 



ncTi 



The relevant parameters are given in Table |2] omitting pa- 
rameters that are not used (zero). We cut-off the redistribution 
function beyond ±1 A from the line centre. We have com- 
pared these approximations to the true redistribution function 
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-16 

wavelength (A) 

Fig. 4. As Fig. [3] but only for photons at 18 A. 

(see Figs.[3]-0|i, and find that our model accurately describes the 
core down to a level of about 1 % of the peak of the redistribu- 
tion function. Also, the flux outside the + 1 A band is less than 
~ 1 % of the total flux for all wavelengths. It should be noted that 
the above approximation is more accurate than the calibration of 
the redistribution itself. Currently, the width of the redistribution 
is known only to about 10% accuracjQ. 

We finally remark that Fig. [3] seems to suggest that the re- 
sponse also contains the second order spectra, for instance for 
9 A photons, apart from the main peak near 9 A, there are sec- 
ondary peaks at 18, 27 and 36 A. These peaks are, however, not 
the genuine higher order spectra, but they correspond to higher 
order photons that by chance end up in the low-energy tail of 
the CCD redistribution function. For instance, a photon with in- 
trinsic energy 1.38 keV would be seen in first order at 9 A, but 
in second order at 18 A. A small fraction of these second order 
events end up in the CCD redistribution tail and are not detected 
near 1.38 keV but near 0.69 keV. Because their (second order) 
measured wavelength is 18 A, they are identified as first order 
events (energy 0.69 keV, wavelength 18 A). As can be seen from 
Fig. [3] this is only a small contribution, and we safely can ignore 
them. 



5. Effective area corrections 

The statistics of our stacked 600 ks spectrum of Mrk 509 is ex- 
cellent, with statistical uncertainties down to 2% per 0.1 A per 
RGS in first order In larger bins of 1 A wide, the quality is 
even better by a factor of ~ 3. These statistical uncertainties are 
smaller than the accuracy to which the RGS effective area has 
been calibrated. A close inspection of the first and second order 
spectra of our source for both RGS detectors showed that there 
are some systematic flux differences of up to a few percent at 
wavelength scales larger than a few tenths of an A (Figs. [S}®, 
consistent with the known accuracy of the effective area calibra- 
tion. These differences sometimes correlate with chip boundaries 
(like at 34.5 A) but not always. A preliminary analysis shows the 
same trends for stacked RGS spectra of the blazar Mrk 42 1 . 



10 11 



^ See also i Kaastra et all ( l2006h . Table 2 where equivalent widths 
based on different representations of the redistribution function are 
compared, and also a comparison to Chandra LETGS data is made. 
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Fig. 5. Flux ratio between RGS 1 and RGS2 for parts of the spec- 
trum that are detected by both RGS. Different symbols indicate 
different combinations of CCD chips for both RGS detectors. 
Open symbols: RGSl chips 1, 3, 5, 9; filled symbols: RGSl 
chips 2, 4, 6, 8. Triangles: RGS2 chips 1, 3, 5, 7, 9; circles: 
RGS2 chips 2, 6, 8. The solid line is a simple fit to the ratio's 
using a constant plus Gaussians. 



While a full analysis is underway, we use here a simple 
approach to correct these differences and obtain an accurate 
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Wavelength (A) 

Fig. 6. As Fig.|5] but for second order spectra compared to first 
order spectra. Due to the specific geometry of missing CCDs 
(CCD7 for RGSl, CCD4 for RGS2) we compai-e all fluxes to 
the first order of RGS2. Open symbols: second order chips 1, 3, 
5, 7, 9; filled symbols: second order chips 2, 4, 6, 8. Triangles: 
first order chips 1, 3, 5, 7, 9; circles: first order chips 2, 6, 8. 



relative eff'ective area calibration. We model the differences 
purely empirical with the sum of a constant function and a few 
Gaussians with different centroids, widths and amplitudes, as 
indicated in Figs. |5H6] We then attribute half of the deviations 
to RGSl and the other half to RGS2. In this way there is bet- 
ter agreement between the diff'erent RGS spectra, in particu- 
lar across spectral regions where due to the failure of one of 
the chips the flux would otherwise show a discontinuity at the 
boundary between a region with two chips and a region with 
only one chip. As the effective area of the second order spectra 
has been calibrated with less accuracy than the first order spec- 
tra, we adjust the second order spectrum to match the first order 
spectrum. 

We obtain the following corrections factors: 

rii(i)= 1.02 + G(^ -0.108,5,2) 
+ G(^, 0.099, 7.5, 0.6) 
+ GU, 0.055, 17.4, 0.6) 
+ G(i, 0.069, 24.4, 1.3) 
+ G(/l, -0.042, 27.3, 0.7) 
+ G(/l, 0.037, 31.3, 1.8), (10) 
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Fig. 7. Adopted eff'ective area corrections for RGS 1 (solid lines) 
and RGS2 (dashed fines), in first and second spectral order. 



ri2(i) = 0.997 H- G(^, 0.103, 11.9, 1.4) 

H- G(^, 0.085, 18.1,2.2), 

r22(i) = 0.995 H- G(^, -0.882, 5.0, 0.7) 

H- G(/l, -0.063, 15.2, 0.9), 



(11) 
(12) 



Here r,y is the flux of RGS / order j relative to RGS2 order 1 as 

shown in Figs. EHU and G(/l, M yu, cr) = f^^-i^- ^i?- I^cr'^ , with 
these definitions, the fluxes need to be multiplied by correction 
factors fl,j to get the corrected values as follows: 



flu = (l + l/r„)/2 

ai\ = (1 +'"ii)/2 

fli2 = (1 +rii)/2ri2 

an = (1 +rii)/2r22. 



(13) 
(14) 
(15) 
(16) 



The above approach may fail whenever the small discrep- 
ancy between one spectrum and the other is caused by a single 
RGS, and not by both. Then there will be a systematic error in 
the derived flux with the same Gaussian shape as used for the 
correction factors, but with half its amplitude. This is compen- 
sated largely, however, by our approach where we fit the true un- 
derlying continuum of Mrk 509 by a spline, and in our analysis 
we make sure that we do not attribute real broad lines to potential 
effective area corrections. The adopted effective area corrections 
(the inverse of the a,y factors) are shown in Fig. |7] Note that for 
wavelength ranges with lacking CCDs the effective area correc- 
tions are based on the formal continuation of the above equa- 
tions, and because of that may differ from unity. However, this is 
still consistent with the absolute accuracy of the RGS effective 
area calibration. 



6. Wavelength scale 

6. 7. Aligning botti RGS detectors 

The zero-point of the RGS wavelength scale has a systematic 
uncertainty of about 8 mA ( Iden Herder et al.|[2001h . According 
to the latest insights, this is essentially due to a slight tilting 
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Fig. 8. Wavelength difference RGS2 - RGSl for the 9 strongest 
Unes. The dashed Une indicates the weighted average of 6.9 ± 
0.7 mA. 



a way that the equivalent width is a free parameter of the model 
and not a derived quantity based on the measured continuum or 
line peak: 

F(A) = c(l + b(A - Ao) + a(A - Aof - -I^e-'-^-^^o^'/^'Axn) 

As we focus in this paper entirely on absorption lines, we re- 
port here EWs for absorption lines as positive numbers. Where 
needed we added additional Gaussians to model the other strong 
but narrow absorption or emission lines that are present in 
the same 1 A wide intervals around the line of interest. The 
Gaussians include the instrumental broadening. For the strongest 
14 lines we could leave the width of the Gaussians a free param- 
eter; for the others we constrained the width to a smooth inter- 
polation of the width from these stronger lines. 

For illustration purposes, we show the full fluxed spectrum 
(after all corrections that we derived in this paper have ben 
applied) in Fig. |9] A more detailed view of the spectrum in- 
cluding realistic spec tral fits is given in paper III of this series 
dPetmers et alj|201 Ih . 



of the grating box due to Solar irradiance on one side of it, 
which causes small temperature differences and hence inhomo- 
geneous thermal expansion. The thermal expansion changes the 
incidence angle of the radiation on the gratings and hence the 
apparent wavelength. The details of this effect are currently un- 
der investigation. Due to the varying angle of the satellite with 
respect to the Sun during an orbit and the finite thermal conduc- 
tivity time scale of the relevant parts, the precise correction will 
depend also on the history of the satellite orbit and pointing. 

As corrections for this effect are still under investigation, we 
take here an empirical approach. We start with aligning both 
RGS detectors. We have taken the 9 strongest spectral lines in 
our Mrk 509 spectrum that can be detected on both RGS de- 
tectors, and have determined the difference between the average 
measured wavelength for both RGS detectors in the combined 
spectrum. We found that the wavelengths measured with RGS2 
are 6.9 + 0.7 mA larger than the wavelengths measured with 
RGSl (Fig.O. 

Therefore, we have re-created the RGS2 spectra by sampling 
them on a wavelength grid similar to the RGS 1 wavelength grid, 
but shifted by +6.9 mA. Using this procedure, spectral lines al- 
ways end up in the same spectral bins. This allows us to stack 
RGSl and RGS2 spectra by simply adding up the counts for 
each spectral bin, without a further need to re-bin the counts 
when combining both spectra. 

After applying this correction, we stacked the spectra of both 
RGS detectors and compared the first and second order spectra. 
We found that the second order centroids agree well with the first 
order centroids (to within an accuracy of 3 mA) and therefore did 
not apply a correction. 

6.2. Strongest absorption lines 

To facilitate the derivation of the proper wavelength scale, we 
have compiled a list of the strongest spectral lines that are 
present in the spectrum of Mrk 509 (Table|3]l. We have measured 
the centroids Aq and equivalent widths EW of these lines by fit- 
ting the fluxed spectra F{A) locally in a 1 A wide band around 
the line centre using Gaussian absorption lines, superimposed on 
a smooth continuum for which we adopted a quadratic function 
of wavelength. We took care to re-formulate this model in such 



6.3. Doppier stiifts and velocity scale 

The spectra as delivered by the SAS are not corrected for any 
Doppier shifts. We have estimated that the time-averaged veloc- 
ity of the Earth with respect to Mrk 509 is -29.3 km s"' for the 
present Mrk 509 data. Because the angle between the lines of 
sight towards the Sun and Mrk 509 is close to 90° during all 
our observations, the differences in Doppier shift between the 
individual observations are always less than 1.0 kms"', with an 
r.m.s. variation of 0.5 km s"' . Therefore these differences can be 
ignored (less than 0.1 mA for all lines). The orbital velocity of 
XMM-Newton with respect to Earth is less than 1.2 kms"' for 
all our observations and can also be neglected. 

We need a reference frame to determine the outflow veloc- 
ities of the AGN. For the referenc e cosmological red shift we 
choose here 0.034397 + 0.000040 (iHuchraet al.lll993h . as this 
value is also recommended by the NASA/IPAC Extragalactic 
datab ase NED, although th e precise origin of this number is un- 
clear; iHuchra et al.l (Il993h refer to a private communication to 
Huchra et al. in 1988. 

Because we do not correct the RGS spectra for the orbital 
motion of the Earth around the Sun, we will instead use an ef- 
fective redshift of 0.03450+0.00004 to calculate predicted wave- 
lengths. This effective redshift is simply the combination of the 
0.034397 cosmological value with the average -29.3 kms"' of 
the motion of the Earth away from Mrk 509. 

For Galactic foreground lines it is common practice to use 
velocities in the Local Standard of Rest (LSR) frame, and we 
will adhere to that convention for Galactic lines. The conver- 
sion from LSR to Heliocentric veloc ities for Mrk 509 is given 
by vlsr - VHeiio + 8.9 kms', cf. iSembach et all d 19991) . 

6.4. Absolute wavelength scale 

The correction that we derived in Sect. |6T| aligns both RGS de- 
tectors and spectral orders, but there is still an uncertainty (wave- 
length offset) in the absolute wavelength scale of RGS 1 . We es- 
timate here this offset by comparing measured line centroids in 
our spectrum to predicted wavelengths based on other X-ray, UV 
or optical spectroscopy of Mrk 509. We consider here five dif- 
ferent methods: 

1 . Lines from the outflow of Mrk 509 
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Fig. 9. Fluxed, stacked spectrum of Mrk 509. Only the lines used in Table |3]have been indicated. See iDetmers et al.l (1201 lb for a 
more detailed figure. 



2. Lines from foreground neutral gas absorption 

3. Lines from foreground hot gas absorption 

4. Comparison with a sample of RGS spectra of stars 

5. Comparison with Chandra LETGS spectra 



6.4.1 . Lines from the outflow of Mrk 509 

The outflow of Mrk 509 has been studied in th e past using h igh- 
resolution UV spectroscopy with FUSE ( Ki'iss et al.ll2000h and 
HST/STIS (iKraemeretal'l 120031) . A preliminary study of the 
COS spectra taken simultaneously with the Chandra LETGS 
spectra in December 2009, within a month from the present 
XMM-Newton observations, shows that there are no major 
changes in the structure of the outflow as seen through the 
UV lines. Hence, we take the archival FUSE an d STIS data 
as templates for the X-ray lines. iKriss et all (l2000h showed the 
presence of seven velocity components labelled 1-7 from high 
to low outflow velo city. Component 4 may have sub-structure 
dKraemer et al.l2003[) but we ignore here these small differences. 



The outflow components form two distinct troughs, a high veloc- 
ity component from the overlapping components 1-3 and a low 
velocity component from the overlapping components 4-7. 

Note that our a nalysis of ISM lin es in the HST/COS spectra 
for our campaign (Kriss et al. '2011") shows that the wavelength 
scale for the FUSE data (Kriss et al. 2000) needs a slight adjust- 
ment. As a cons equence, we subtract 26 kms"' from the veloc- 
ities reported bv lKriss et al.l (1200 0). in addition to the correction 
to a different host galaxy redshift scale. 

In Table |4] we list the column density weighted or equiva- 
lent width weighted average velocities. We also give the typical 
ionisation parameter ^ for which the ion has its peak concentra- 
tion, based upon the analysis of archival XMM-Newton data of 
Mrk 509 by Detmers et al. (2010 ). 

In principle, equivalent-width weighted line centroids are 
preferred to predict the centroids of the unresolved X-ray lines. 
As for most of the X-ray lines the optical depths are not very 
large, using just the column density weighted average will not 
make much difference, however. 
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Table 3. Strongest absorption lines observed in the spectrum of Mrk 509. 



Ion 



Fexxi 

Fexix 

Fe xvm 

Mgxi 

Nex 

Neix 

Neix 

Nevni 

Nevii 

Oviii 

Ovm 

Ovm 

Ovm 

Ovii 

Ovii 

Ovii 

Ovii 

Ovi 

Ov 

Nvii 

Nvi 

Nv 

Cvi 

Cvi 

Cvi 

Cv 



Fe XVII G 

NexG 

NeixG 

OviiiG 

OviiG 

OviiG 

OviG 

CviG 



OiG 

NiG 



Trans- 
ition 



(mA) 



A 



'llab'' 

A 



V 

(kms-') 



Reference*^ 



2p-3d 3.0 ±0.8 12.664 ±0.008 12.2840 ± 0.0020 -990 ±200 Brown et al. (2002) 

2p-3d 8.5 ±1.2 13.979 ±0.005 13.516 ±0.005 -20 ±150 Phillios et al. (1999) 

2p-3d 5.6 ±0.8 14.680 ±0.005 14.207 ±0.010 -330 ± 230 Phillips et al. (1999) 

s-2p 3.6 ±0.7 9.465 ±0.005 9.1688 ± 0.0000 -590 ±150 Flembers (1942) 

s-2p 9.8 ±0.9 12.539 ±0.002 12.1346 ± 0.0000 -300 ±40 Erickson (1977) 

s-3p 2.5 ±0.9 11.921 ±0.009 11.5440 ± 0.0020 -480 ± 240 Peacock et al. (1969) 

s-2p 13.2 ±1.1 13.897 ±0.003 13.4470 ± 0.0020 -260 ±70 Peacock et al. (1969) 

s-2p 5.6 ±0.8 14.118 ±0.006 13.654 ±0.005 -100 ±160 Peacock et al. (1969) 

s-2p 4.0 ±0.8 14.282 ±0.006 13.8140 ± 0.0010 -130 ±130 Behar & Netzer (2002) 

s-5p 2.4 ±0.9 15.332 ±0.010 14.8205 ± 0.0000 +30 ±190 ' Erickson (1977) 

s^p 5.2 ±0.8 15.692 ±0.006 15.1762 ± 0.0000 -120 ±110 Erickson (1977) 

s-3p 13.4±1.2 16.546±0.002 16.0059 ± 0.0000 -190±40 Erickson (1977) 

s-2p 31.8±1.4 19.615±0.001 18.9689 ± 0.0000 -110±20 Erickson (1977) 

s-5p 2.7 ±0.9 18.022 ±0.013 17.3960 ± 0.0020 +460 ± 230 Engstrom & Litzen (1995) 

s^p 10.3 ±1.3 18.382 ±0.004 17.7683 ± 0.0007 +40+60 Engstrom & Litzen (1995) 

s-3p 14.8 ±1.0 19.265 ±0.003 18.6284 ± 0.0004 -70 ± 50 Engstrom & Litzen (1995) 

s-2p 32.7±2.1 22.335±0.002 21.6019 ± 0.0003 -140±20 Engstrom & Litzen (1995) 

s-2p 21.2 ±2.3 22.771 ±0.005 22.0194 ± 0.0016 -90 ±70 Schmidt et al. (2004) 

s-2p 15.8 ±2.9 23.120 ±0.005 22.3700 ± 0.0100 -260 ±140 Gu et al. (2005) 

s-2p 19.0 ±2.3 25.625 ±0.004 24.7810 ± 0.0000 -110 ±50 Erickson (1977) 

s-2p 23.8 ±2.8 29.773 ±0.004 28.7875 ± 0.0002 -70 ±40 Engstrom & Litzen (1995) 

s-2p 7.6 ±2.3 30.402 ±0.014 29.414 ± 0.004 -240 ± 150 Beiersdorfer et al. (1999) 

s^p 5.7 ±1.9 27.933 ±0.015 26.9898 ± 0.0000 +150+160 Erickson (1977) 

s-3p 16.2 ±2.8 29.438 ±0.007 28.4656 ± 0.0000 -90 ± 70 Erickson (1977) 

s-2p 49.3 ±3.0 34.890 ±0.002 33.7360 ± 0.0000 -70 ± 20 Erickson (1977) 

s-3p 17.1 ±3.2 36.174 ±0.005 34.9728 ± 0.0008 -30 ±40 Edlen & Lof strand (1970) 



2p-3d 2.5 ±0.8 15.021 ±0.008 15.0140 ± 0.0010 +160+160 Brown et al. (1998) 

s-2p 1.6 + 0.8 12.125 + 0.010 12.1346 + 0.0000 -210 + 240 Erickson (1977) 

s-2p 4.2 + 0.9 13.454 + 0.006 13.4470 + 0.0020 +180+150 Peacock et al. (1969) 

s-2p 8.2+1.1 18.966 + 0.005 18.9689 + 0.0000 -30 + 80 Erickson (1977) 

s-3p 5.7+1.1 18.624 + 0.010 18.6284 + 0.0004 -70+170 Engstrom & Litzen (1995) 

s-2p 16.6 + 2.2 21.604 + 0.004 21.6019 + 0.0003 +30 + 50 Engstrom & Litzen (199^ 

s-2p 4.5 + 1.8 22.025 + 0.013 22.0194 + 0.0016 +70+180 Schmidt et al. (2004) 

s-2p 8.8 + 2.5 33.743 + 0.013 33.7360 + 0.0000 +80+170 Erickson (1977) 



s-2p 29.1 + 2.8 23.521+0.003 23.5113+0.0018 +130 + 50 Kaastra et al. (2010) 

s-2p 22.2 + 4.3 31.302 + 0.009 31.2857 + 0.0005 +160 + 80 Sant' Anna et al. (2000) 



Equivalent width 

Observed wavelength using the nominal SAS wavelength scale for RGSl, with RGS2 wavelengths shifted downwards by 6.9 mA. 

Laboratory wavelength (experimental or theoretical) with best estimate of uncertainty 

Outflow velocity; for lines from the AGN relative to the systematic redshift of 0.034397; for lines from the ISM (labelled "G" in the first 

column) to the LSR velocity scale. Observed errors and errors in the laboratory wavelength have been combined. The velocities are based on 

the observed wavelengths of column (4) but corrected for the orbital motion of the Earth, and have in addition been adjusted by adding 1.8 mA 

to the observed wavelengths (see Sect. l6.4.6t . 

References for the laboratory wavelengths 



The strong variation of average outflow velocity as a func- 
tion of ^ that is apparent in Table H] in particular between O vi 
and N v, shows that caution must be taken. Therefore, we con- 
sider here only X-ray lines from ions which overlap in ionisa- 
tion parameter with the ions in Table |4] We list those lines in 
Table |5] For C v and O v we have interpolated the velocity be- 
tween the O VI and N v velocity according to their log ^ value. 
The weighted average shift A/l = /lobs - '^pred is -5.3 ± 3.6 mA. 



6.4.2. Lines from foreground neutral gas absorption 

Our spectrum (see Table O contains lines from the neutral as 
well as the hot phase of the ISM. Here we discuss the neutral 
phase. The spectrum shows two strong lines from this phase, the 



Oi and Ni is-2p lines. Other lines are too weak to be used for 
wavelength calibration purposes. 

The sight line to Mrk 509 is dominated by neutral mate- 
rial at LSR velocities of about H-10 kms"' and +60 kms"', 
as seen for inst ance in the Nai Di and D2 lines, and the Can 
H and K lines (Yorket al. 1982) . Detailed 21 cm observations 
dMcGee & Newtonlll986h show four components: two almost 
equal components at -6 and +7 kms"', a weaker component 
(6% of the total H i column density) at +59 km s"' and the weak- 
est component at +93 kms"' containing 0.7% of the total col- 
umn density. 

The strongest X-ray absorption line from the neutral phase 
is the Oi ls-2p transition. The rest-frame wavelength of this 
line has been determined as 23.5113 + 0.0018 A (Kaastra et al 



2010). Based on the velocity components of iMcGee & Newton 



(1 19861) . we have estimated the average velocity for this line as 



10 



J.S. Kaastra et al.: Multiwavelength campaign on Mrk 509 



Table 4. Outflow velocities in km s derived from UV lines. 



Table 6. Wavelengths (A) of the O iv ls-2p transitions; all tran- 
sitions are from the ground state to ls2s^2p^. 



Feature 


logr 


Comp 


Comp 


Comp 


w^ 


Ref 
















1-3 


4-7 


1-7 






^D3/2 


^Pw. 


'Pv. 


^S,„ 


reference 


Hi .11026 




-317 


+6 


-56 


N 


FUSE 


22.777 


22.729 


22.727 


22.571 


HULLAC 


Hi ^1026 




-330 


+48 


-89 


EW 


FUSE 


22.755 


22.739 


22.736 


22.573 


Cowan code*" 


Oviil032/38 


+0.4 


-329 


+44 


-28 


N 


FUSE 


22.77 


22.74 


22.74 


22.66 


AS2 (Autostructure)'= 


Oviil032/38 




-338 


+64 


-78 


EW 


FUSE 


22.78 


22.75 


22.75 


22.65 


HFl (Cowan code)'^ 


Nv ^1239/43 


-0.1 


-344 


+21 


-166 


N 


STIS 


22.73 


22.67 


22.46 


R-matrix'' 


Civ^ 1548/51 


-0.6 


-328 


+ 1 


-197 


N 


STIS 




22.741 


+0.004 




experimenf^ 


C 111^977.02 


-1.4 


-347 


-16 


-224 


N 


FUSE 












22.777 


22.741 


22.739 


22.571 


Adopted value 


Cm ^977.02 




-339 


-27 


-231 


EW 


FUSE 


0.010 


0.004 


0.004 


0.025 


Adopted uncertainty 



" ionisation parameter in 10 ' Wm 

*" N: weighted average using column densities; EW: weighted average 



equi\ 
: lKri 



FUSE: iKriss et al. t2000h . STIS : iKraemer et all ( l2003h 



E. Behar 2001, private communication 
A.J.J. Raassen 2001, private communication 
Garcia et al.l J2005l) 



Pradhan et all i 2003h 
Guetal.1 ( 12005) 



Table 5. Wavelength differences (observed minus predicted, in 
mA) for selected X-ray lines of the outflow. 



Line 


logr 


v^ 


Av*^ 


^obs ^pred 






(kms-') 


(kms-') 


mA 


N VI ^28.79 


+0.7 


-28 


50 


-5 ±7 


VI ^22.02 


+0.4 


-28 


10 


-6 ±5 


Cv/i34.97 


+0.1 


-110 


100 


+8± 13 


Ov ^22.37 


0.0 


-138 


40 


-11±12 


Nvi29.41 


-0.1 


-166 


10 


-10±15 


average 








-5.3 ±3.6 



ionisation parameter in lO"' Wm 
adopted velocity based on UV lines 

adopted systematic uncertainty in velocity based on the differences 
in centroids for the UV lines 

statistical uncertainty observed line centroid, uncertainty in labora- 
tory wavelength (Table O and the adopted systematic uncertainty 
in velocity have been added in quadrature 



ment and the HULLAC calculations, while the adopted uncer- 
tainties are based on the scatter in the wavelength differences. 

In the present case, both the O IV ls-2p^D3/2 and ls-2p-Pi/2 
and ^P3/2 transitions of the outflow blend with the Galactic O i 
line, with relative contributions of 37, 42 and 21%, respectively. 
The precise contamination is hard to determine, as these are the 
strongest O iv transitions and the other O iv transitions are too 
weak to determine the column density. Based on the absorption 
measure distribution of other ions with similar ionisation param- 
eter, we estimate that the total equivalent width of these three 
O IV lines is 3.6 + 2.0 mA dPetmers et al.ll20111) . The same mod- 
elling predicts that the O iv concentration peaks at log ^ = -0.6, 
at almost the same ionisation parameter as for C iv. The opti- 
cal depths of all X-ray lines of Oiv is less than 0.1, hence we 
use the column density weighted average line centroid of C iv 
(-197 kms ') for Oiv. With this, we predict a wavelength for 
the combined ^D and ^P ti'ansitions of O iv of 23 .523 + 0.004 A. 



well as the UV line at 1302 A, as suming an oxy gen abundance 
of 5.75 X 10 '^ (proto-Solar value: lLoddersll2003h . adopting that 
~ 50% of all neutral oxygen is in its atomic form. We predict 
for the A23.5 A line an LSR velocity of +24 kms ', and for 
the /11302 A line a velocity of +40 kms"'. For a two times 
higher column density, these lines shift by no more than +6 and 
+ 1 kms"\ respectively. For comparison, the measured centroid 
of the /11302 A line ( ColUns et a l."2004'. Fig. 6) is +40 kms"', 
in close agreement with the prediction. A systematic uncertainty 
of +10 kms ' on the predicted A23.5 A line seems therefore to 
be justified. Taking this into account, we expect the O i line at 
23.515 ±0.002 A. 

Unfortunately, the Galactic O i A23.5 A line is blended by ab- 
sorption from lines of two other ions in the outflow of Mrk 509, 
O IV and Ca xv. We discuss those lines below. 

Contamination by O iv: The O iv ls-2p ^P transition, which has 
a laboratory wavelength of 22.741 ± 0.004 A dGu et al.ll2005l) . 
contaminates the Galactic O i line. In fact, the situation is more 
complex, as the main Oiv ls-2p multiplet has four transitions. 
In Table |6] we summarise the theoretical calculations and exper- 
imental measurements of the wavelengths of these lines. The 
adopted values are based on the single experimental measure- 



Contamination by Ca xv: The other contaminant to the Galactic 
Oi ls-2p line is the Caxv 2p-3d (^o - ^Dj) ^22.730 A transi- 
tion of the outflow. This is the strongest X-ray absorption line 
for this ion. Based on the measured column density of Ca xiv in 
the outflow, and assuming that the Ca xv column density is sim- 
ilar, we expect an equivalent width of 4.9 ± 2.3 mA for this line, 
about 10 % of the strength of the O i blend. 

The rest-wavelength of this transition is somewhat uncer- 
tain, however. iKellvl (Il987|) gives a value of 22.72 5 A, with- 
out errors, and refers to 'Broma ge & Fawcetn (Il977h who state 
that their theoretical wavelength is within 5 mA from the ob- 
served value of 22.730 A, and that the line is blended. Although 
not explicitly mentioned, their experimental value comes from 
Kellv & Palumbo (1973) (cited in the Bromage & Fawcett pa- 
per). iKeUv & Palumbol (1 19731) gives a value of 22.73 A, but 
states it is for the ^ Pa - ^D3 transition, whi ch is a misclassifi- 
cation according to Fawcett & HavesI (Il975h . They hst as their 
source iFawcettI (1197 Ih . The latter pu blication gives the value 
of 22.73, but no accuracy. In iFawcetti (197 0) the typical accu- 
racy of the plasma measurements is limited by Doppler mo- 
tions of ab out lO'^^c . If we assume that accuracy also holds for 
the data in iFawcettI (Il97lh . the estimated accuracy should be 
about 2 mA. Given the rounding of IFawcetti (Il970l) . an rm.s. 
uncertainty of 3 mA is more appropriate. However, there is 
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some level of blending with other multiplets. Blending is prob- 
ably due to other 2p-3d transitions in this ion, most likely 
transitions be tween the ^P-^D, ^P-^P and 'D-'F multiplets 
dAggarwal & Keenan 2003) . A compariso n of the theoretical 
wavelengths of A ggarwal & KeenanI (l2003i) with the experimen- 
tal values of Kellvl (Il987h for all Unes of the ^P - ^^D multiplet 



6.4.3. Lines from foreground hot gas absorption 



shows, apart from an offset of 145 mA, a scatter of 6 mA. 

Given all this, we adopt here the original measurement of 
iFawcettI (I1971.) . but with a systematic uncertainty of 5 mA. As 
Caxv has a relatively large ionisation parameter (log^ - 2.5), 
we expect that like other highly ionised species in Mrk 509 it is 
dominated by the high-velocity outflow components, that have 
a velocity of about -340 kms"' (see Table |4|. Putting a con- 
servative 200 km s"' uncertainty on this (allowing a centroid up 
to halfway between the two main absorption troughs), we may 
expect the Hne at 23.488 ± 0.017 A. 



Centroid of tine Oi blend: The total equivalent width of the 
Oi blend is 29.1 + 2.8 mA (see Table [3]i. The velocity dis- 
persion of the contaminating O iv and Ca xv lines is an order 
of magnitude larger than the small velocity dispersion of the 
cold Galactic gas, and both contaminants have optical depth 
smaller than unity. The total blend is thus the combination of 
broad and shallow contaminants with deep and narrow O i com- 
ponents. Thus the contribution due to O i itself is obtained by 
simply subtracting the contributions from O iv and Ca xv from 
the total measured equivalent width of the blend. This yields an 
O I equivalent width of 20.6 + 4.2 mA. Taking into account the 
contribution of the contaminants, we expect the centroid of the 
blend to be at 23.5 1 15 + 0.0032 A. This corresponds to an offset 
A^ = 9.9 + 4.5 mA. 



N i: The other strong X-ray absorption line from the neutral 
phase of the ISM is the N i ls-2p line. We adopt the same LSR 
velocity as for the unblended Oi line, -h24 kms '. This is con- 
sistent with the measured line profile of the Ni /11199.55 A 
line, which show s two troughs at about and H-50 kms ' 
dCoUins et al. '2004). The shift derived from the N i line is there- 
fore H- 12.2 + 8.9 mA. 

We note, however, that for Ni we can only use RGS2, as 
RGSl shows some instrumental residuals near this line. The 
nominal wavelength for RGS 1 is 26 mA higher than for RGS2. It 
is therefore worthwhile to do a sanity check. Therefore we have 
determined the same numbers from archival data of Sco X-1. 
That source was observed several times with significant offsets in 
the dispersion direction, so that the spectral lines fall on different 
parts of the CCDs or even on different CCD chips. We measure 
wavelengths for O i and N i in Sco X-1 of 23.5123 ± 0.0015 and 
31.2897 + 0.0030 A, resp ectively. Sco X-1 has a H eliocentric 
velocity of -114 kms ' (ISteeghs & Casaresll2002l) . Given its 
Galactic longitude of (fJ, we expect the local foreground gas 
to have negligible velocity. This is confirmed by an archival 
HST/GHRS spectrum, taken from the HST/MAST archive, and 
showing a velocity consistent with zero, to within 5 kms ' for 
the N I /ll 199.55 A line from the ISM. This then leads to consis- 
tent observed offsets for the ls-2p transitions of Oi and Ni in 
Sco X-1 of 1.0 ± 2.3 mA and 4.0 + 3.0 mA, respectively. 

Combining both the Oi and the Ni line for Mrk 509, we 
obtain a shift of +10.3 ± 4.0 mA. 



Table 7. Decomposition of the Ovi profile of ISembach et al.l 
(l2003h into Gaussian components. 



Vlsr o" column 

(kms-') (kms-') (10'* m--) 



1 


-245 


59 


214 


2 


-126 


16 


23 


3 


-64 


13 


19 


4 


+26 


44 


440 


5 


+ 152 


24 


26 



Our spectrum also contains significant lines from hot, 
ionised gas along the line of sight, in particular from O vi, O vii, 
Ovm, and Cvi. S embach et al. (2003) give a plot of the Ovi 
line profile. This line shows clear high-velocity components at 
-247 and -143 kms-^ Unfortunately, no column density o f 
the main Galactic component is given bv lSembach et al.l (l2003h . 
Therefore we have fitted their profile to the sum of 5 Gaussians 
(Table |7]l. The column-density weighted centroid for the full line 
is at -56 kms-' in the LSR frame. We estimate an uncertainty 
of about 8 km s-' on this number 

Note that a part of the high-velocity trough in the blue is 
contaminated by a molecular hydrogen line at 103 1 . 19 A, which 
falls at Vlsr = -206 kms-'. This biases component 1 of our fit 
for the hot foreground galactic component by about 10 kms-'. 
This component in C iv. Si iv, and N v in the COS spectra is at a 
velocity of vlsr = -245 km s-' dKriss et al.»2011i) . and therefore 
we adopt that value here. 



Table 8. Line shifts AA = /lobs - 'ipied in mA for the X-ray ab- 
sorption lines from the hot interstellar medium. 



transition 


A/l" 


AA^ 


error^ 


error'' 




(mA) 


(mA) 


(mA) 


(mA) 


Fe XVII 2p-3d 


9 


5 


8 


13 


Nex ls-2p 


-8 


-11 


10 


13 


Neix ls-2p 


9 


6 


7 


11 


Ovii ls-3p 


-3 


-8 


10 


16 


Ovm ls-2p 





-5 


5 


14 


Ovii ls-2p 


-2 


-8 


5 


15 


Ovi ls-2p 


8 


2 


13 


13 


Cvi ls-2p 


11 


1 


13 


26 


" Using VLSR = 


-56 km 


S-' 






'' Using Vlsr = 


+26 km 


S-' 







"^ Only statistical error 

'' Including uncertainty of : 



;200 km s ' except for O vi 



While this velocity can be readily used for predicting the 
observed wavelength of the O vi ls-2p line, the uncertainty for 
other X-ray lines is larger, because the temperatures or ionisation 
states of the dominant components 1 and 4 may be different. 
Moreover, O vi is the most highly ionised ion available in the UV 
band, while most of our X-ray lines from the ISM have a higher 
degree of ionisation. Therefore, without detailed modelling of 
all these components, we conservatively assume here for all ions 
other than Ovi an error of 200 kms' on the average outflow, 
corresponding to the extreme cases that the bulk of the column 
density is in either component 1 or 5. 
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The Galactic Ovii ls-2p line at 21.6 A suffers from some 
blending by the N vii 1 s-3p line from the outflow. That last line, 
with a predicted equivalent width of 4.5 mA, has a relatively high 
ionisation parameter, log^ = 1.4. Therefore we do not know 
exactly what its predicted velocity should be, and we conserva- 
tively assume a value of -100 + 200 kms"', spanning the full 
range of outflow velocities. With that, the predicted wavelength 
of the N VII outflow line is 2 1 .624 + 0.0 1 4 A, while the predicted 
wavelength of flie O vn line is 2 1 .5993 ±0.0006 A. The predicted 
wavelength of the blend then is 21.606 + 0.004 A. 

In Tabled] we list the lines that we use and the derived line 
shifts AA. The weighted average is + 1 .7 ± 2.6 mA when we only 
include statistical errors, and +2.8 ± 4.9 mA if we include the 
systematic uncertainties. 

We make here a final remark. Combining all wavelength 
scale indicators including the one above (Sect. I6.4.6I I, we ob- 
tain an average shift of - 1 .6 mA. Using this number, the most 
significant lines of Table[8](Ovii and O viii ls-2p) show clearly 
a velocity much closer to the dominant velocity component 4 
(+26 kms"'. Table |7]l than to the column-density weighted ve- 
locity of the O VI line at -56 km s"' . This last number is signif- 
icantly affected by the high-velocity clouds, that apparently are 
less important for the more highly ionised ions. Therefore, in 
Table [8] we also list in column 3 the wavelength shifts under the 
assumption that all lines except O vi are dominated by compo- 
nent 4. With that assumption, and taking now only the statistical 
errors into account, the weighted average shift is -1.7 ± 2.8 mA. 

6.4.4. Comparison with a sample of RGS spectra of stars 

Preliminary analysis (R. Gonzalez-Riestra, private communica- 
tion) of a large sample of RGS spectra of stellar coronae indi- 
cate that there is a correlation between the wavelength offsets 
and the Solar Aspect Angle (SAA) of the satellite (correlation 
coefficient about 0.6). At a mean SAA of 90°, the average AA 
for RGSl is 2.3 mA, while for RGS2 it is 7.3 mA. The scatter 
for an individual observation is 6 and 7 mA, respectively. The 
difference between RGS 1 and RGS2 is fairly consistent with the 
6.9 ± 0.7 mA that we adopted for the present work. As our obser- 
vation campaign on Mrk 509 spans the full visibility window, the 
average SAA angle is 90°; it varied between 108?5 for the first 
observation, to 73?3 for the last observation. The wavelengths in 
our spectrum (averages of the original RGS 1 wavelengths and 
the RGS2 wavelengths minus 6.9 mA) should then be off by 
+ 1.4 ±6.5 mA. 

The above estimate can be refined further. It appears that 
there is also a correlation of the wavelength offset with the tem- 
perature parameters T2007 and T203 1 of the grating boxes on 
RGS 1 and RGS2, respectively. However, these temperatures do 
not correlate well with the SAA. Hence, following R. Gonzalez- 
Riestra et al. (private communication) and using their data on 38 
spectra of stellar coronae, we obtain the following correlations: 



Table 9. Average line shifts for the six strongest Unes, relative to 
the total spectrum. 



RGSl : Ai = 4.92 



RGS2: Ai = 8.32 



0.49(SAA - 90°) 
12.8(T2007 - 20.7), 
0.55(SAA - 90°) 
11.8(T2031-20.7), 



(18) 
(19) 



where SAA is expressed in degrees, and the temperatures in °C. 
The scatter around this relation is 4. 1 and 4.8 mA for RGS 1 and 
RGS2, respectively. This scatter is significantly smaller than the 
scatter using only the correlation with SAA mentioned above. 



Obs 


A-A^ 


SAA 


T2007 


T2031 


A-A^ 




(mA) 


(°) 


(°C) 


(°C) 


(mA) 


1 


-6.2 ±3.0 


108.5 


21.38 


21.48 


-10.60 


2 


-12.2 + 2.9 


104.1 


21.26 


21.38 


-6.97 


3 


+3.9+1.9 


100.7 


21.32 


21.42 


-5.80 


4 


+2.4 + 2.2 


94.7 


21.24 


21.33 


-1.68 


5 


-3.6 + 2.1 


90.9 


21.43 


21.51 


-1.98 


6 


-3.0 + 2.1 


86.7 


21.51 


21.54 


-0.48 


7 


0.0+2.3 


82.7 


21.28 


21.40 


3.90 


8 


+7.3 + 2.3 


78.7 


21.32 


21.43 


5.52 


9 


+7.6 + 2.1 


75.0 


21.24 


21.35 


8.42 


10 


+0.3 + 2.3 


72.7 


21.24 


21.34 


9.68 


alF 


= 


89.5 


21.32 


21.42 


= 



Observed wavelength difference for individual observations. 
Predicted wavelength difference for individual observations using 
(I18ll-l|19l); results for RGSl and RGS2 have been weighted aver- 
aged (not all lines present in each RGS). 
Combined spectrum. 



o 


' 


' 


1 








1 


1 1 


1 


< - 




+ 


( 1 






-f- 






■-+ : 


E ° 








r 


+ 


A 
V 

1 


I 


t 




+ 


-< o 


_ 


















: 




1 


1 
















Residuals (mA 
-10 10 






1 


♦ 


1 


+ 


1 


* 


1 








> 


4 6 
Observation number 


8 




10 



Fig. 10. Top panel: wavelength difference for the six strongest 
lines in each observation relative to the lines in the average spec- 
trum (data points) and model based on Eq. (fT8b-[T9l(histogram). 
Bottom panel: residuals of the data points relative to the model 
in the upper panel. 



We apply this now to the 10 individual observations of 
Mrk 509. We have determined the individual line centroids of 
the 6 strongest lines (C vi 1 s-2p, O vni 1 s-2p and 1 s-3p, O vii 
ls-2p, Ne IX ls-2p all from the outflow, and O i ls-2p from the 
foreground). We then take the difference with the wavelength 
in the combined observation, and averaged these residuals. We 
show the results in Tableland Fig.fTOl 

The model gives an improvement: the rm.s. residuals (af- 
ter correction for the mean statistical errors on the data points 
of 2.3 mA) are reduced from 5.4 to 4.6 mA. This is fully con- 
sistent with the remaining residuals in the set of coronal spec- 
tra discussed above. It is also clear that the remaining residuals 
are non-statistical, and although on average the model gives an 
improvement, in some individual cases the opposite is true, for 
instance for observations 3, 7 and 10. 
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Keeping this in mind, we can use the model to predict the 
absolute wavelength scale. We predict absolute offsets AA of 
-2.76 and +0.10 mA for RGSl and RGS2, respectively. In our 
extraction procedure, we have subtracted 6.9 mA from the RGS2 
wavelengths; doing the same for the predicted RGS offset, and 
averaging the offsets for RGS 1 and RGS2, the model predicts a 
total offset of -4.8 mA. We only need to apply a small correc- 
tion for the velocity of the Earth. It can be assumed that the sam- 
ple of stars has, on average, zero redshift due to the motion of 
Earth, because usually each source can be observed twice a year 
with opposite signs of the Doppler shift. Our data of Mrk 509 
were taken with a rather extreme Earth velocity of -29.3 km s"' . 
At a typical wavelength of 20 A, this corresponds to a correc- 
tion factor of 2.0 mA. We finally obtain a predicted offset of 
-2.8 + 4.5 mA where the error is the average systematic resid- 
ual found for the sample of coronae mentioned before. 

6.4.5. Comparison with Cliandra LETGS spectra 



Table 11. Overview of wavelength scale indicators. 



Indicator 



AA = Aahs - 'Ipred 

(mA) 



Lines from tlie outilow of Mrk 509 
Lines from foreground neutral gas" 
Lines from foreground neutral gas'' 
Lines from foreground hot gas absorptiotf 
Lines from foreground hot gas absorption'' 
Comparison with a sample of RGS spectra of stars 
Comparison with Chandra LETGS spectra 



-5.3: 
+ 10.3: 
+6.5: 
+2.8: 
-1.9: 
-2.8: 
-7.3: 



3.6 
4.0 

:3.4 

4.9 
4.9 

:4.5 

3.8 



All indicators combined"'-' 
All indicators combined'''^ 
All indicators combined"'' 
All indicators combined'''' 



-1.0: 
-1.2: 
-1.6: 
-1.8: 



1.8 
1.8 

1.8 
1.8 



" As described in Sect. 16.4.21 

*" Ignoring the blending by Ca xv of the O i line 

'^ Using the O vi centroid velocity 

'' Using the velocity of dominant component 4 in O vi 



Table 10. Comparison of measured wavelengths from the RGS 
and LETGS. 



transition 


RGS 


LETGS 


RGS - LETGS 




(A) 


(A) 


(mA) 


Ovin ls-3p 


16.5460 + 0.0023 


16.552 + 0.004 


-6 + 5 


Ovii ls-3p 


19.2649 + 0.0029 


19.269 + 0.007 


-4 + 8 


Oviii ls-2p 


19.6146 ± 0.0012 


19.619 + 0.004 


-4 + 4 


Ovii ls-2p 


22.3354 + 0.0018 


22.346 + 0.004 


-11±4 



We have also measured the wavelengths of the ls-2p and 
1 s-3p lines of O vii and O viii in the Chandra LETGS spectrum 
of Mrk 509 taken 20 days after our last XMM-Newton observa- 
tion. LETGS has the advantage above RGS that it measures both 
positive and negative spectral orders, hence allow to determine 
the zero-point of the wavelength scale more accurately. TablefTOl 
lists the wavelengths measured by both instruments. The average 
wavelength offset of RGS relative to LETGS is -6.9 ±2.4 mA. 
The above number only includes the statistical uncertainties. We 
have extracted an LETGS spectrum of Capella, taken around 
the same time as the Mrk 509 observation. The ten strongest 
lines in the spectrum of Capella between 8-34 A show a scatter 
with r.m.s. amplitude of 6 mA compared to the laboratory wave- 
lengths, but no systematic offset larger than 2 mA. We therefore 
add a systematic uncertainty of 6/ Vi - 3 mA in quadrature to 
the statistical uncertainty of the average AA for the four lines in 
Mrk 509 that we use here. 

We only need to make a small correction of -0.4 mA cor- 
responding to the higher speed of Earth away from Mrk 509 
(-29.3 kms"') during the XMM-Newton observation as com- 
pared to the Chandra observation (-23.3 kms '). This all leads 
to an offset of -7.3 ±3.8 mA. 



6.4.6. Summary of results on tine wavelengtii scale 

We now summarise our findings from the previous sections 
(Table fTTI). All estimates agree within the error bars, except for 
the lines from the neutral foreground gas, which is off by about 
3cr. This number is dominated by the rather complex O i blend. 
Possibly there are larger systematic uncertainties on this num- 
ber than we assumed. The major factor here is the blending with 



Ca XV. If we assume that the equivalent width of the contaminat- 
ing Ca XV line is zero, the predicted blend centroid decreases by 
4.4 mA, towards better agreement with the other indicators. The 
O IV blend has a much smaller impact, as its centroid is already 
close to the centroid of the Galactic O i line. 

We note that a smaller value of the predicted equivalent 
width of the Ca xv line may be caused by a somewhat lower 
Ca abundance; alternatively, if the density is higher than about 
lO'* m"^, th e population of Ca xv in t he ground state decreases 
significantly ("Bhatia & Doschek' l993h . leading also to a reduced 
equivalent width of this absorption line. 

All five methods used here have their pros and cons, and we 
have done the best we can to estimate the uncertainties for each 
of the methods. The five methods are for the major part statis- 
tically independent, and this allows us to use a weighted aver- 
age for the final wavelength correction. Ignoring the blending 
by Caxv to the Galactic Oi line, and using +26 kms ' for the 
outflow velocity of the hot Galactic gas, this wavelength shift is 
-1.8 ± 1.8 mA. Within their statistical uncertainties, all individ- 
ual indicators are consistent with this value, except for the lines 
from foreground neutral gas, which are off by 8.3 ± 3.8 mA. We 
do not have an explanation for that deviation. 

In our analysis, we apply the wavelength coiTections by 
shifting the wavelength grids in our program rgsJmat. 

7. AGN outflow dynamics 

To show how the optimised wavelength calibration improves the 
scientific results, we give here a brief analysis of the velocity 
structure of the AGN outflow as measured in X-rays through two 
Rydberg s eries of oxygen ions. A full analysis of the spectiTim 
is given bv lDetmers et al.l (|20II|) . 

Using the measured outflow velocities for individual lines, 
we can immediately derive some interesting astrophysical con- 
clusions (Fig.fTTTi. 

Most of the lines are in between the high-velocity compo- 
nents 1-3 and the lower veloci ty components 4-7 as found in 
the Ovi lines dKriss et al.ll2000h . This shows that there are mul- 
tiple velocity components present in the X-ray absorption lines. 
At longer wavelengths these lines are closer to the low velocity 
components, showing that there is more column density in those 
components. However, for the shortest wavelength lines, in par- 
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Fig. 11. Average outflow velocity as measured through individ- 
ual lines with statistical errors smaller than 100 kms '. The 
dashed lines show the 7 velocity components as seen in archival 
FUSE data (.Kriss et al..,200Q) . 




Fig. 12. Equivalent width and line centroid for the ls-2p to Is- 
5p lines of O viii, plotted versus principal quantum number n of 
the upper level. The predicted values for model 1 are shown as 
dashed lines, those for model 2 as solid lines. 



ticular the 1 s-2p transitions of Ne ix and Ne x, the centroid is 
in the range of the high- velocity components 1-3. As the shorter 
wavelength lines originate from ions with a higher ionisation pa- 
rameter, there is a correlation between the ionisation parameter 
and the outflow velocity. 

The relative deviation of the 17.8 A line (O vii Is^p) with 
respect to the other lines, including the ls-2p and ls-3p lines 
of the same ion, can be easily explained: according to prelim- 
inary modelling, the optical depth at line centre for the ls-2p, 
ls-3p and ls-4p lines of Ovii is 8, 1.4 and 0.5, respectively. 
Thus, if the O vii column density of the low velocity component 
is larger than that of the high-velocity component, the strongest 
low-velocity line components (like ls-2p, ls-3p) will saturate 
while the high-velocity components of these transitions will not 
saturate, hence eff'ectively these strong lines will be blueshifted 
relative to the weaker Is^p line, consistent with what is ob- 
served. 

We have elaborated on the line diagnostics for two of the 
most important ions in the outflow of Mrk 509 to see if we can 
detect both velocity components: O vm and O vn. 



shown in Table [T2l and Fig. [12] Both models give a good fit and 
cannot be distinguished on statistical grounds. The total column 
densities for both models are the same within their error bars, 
and also the sum of the width of the Gaussians for Model 2 is 
consistent with the width of the single Gaussian for Model 1 . 
Apparently, as far as the determination of total line width and 
column density is concerned, the detailed column density distri- 
bution as a function of velocity is not very important. This can be 
understood as the strongest line with the best statistics (ls-2p) 
has a large optical depth (12.7 and 4.9 for the two components in 
model 2). The line core is thus black, the line shape rather "rect- 
angular" and to lowest order approximation the width of the line 
is given by the sum parts of the spectrum where the core is black. 
For Model 2, the total width for each of the components 
agrees well with the total width of components 1-3 and 4-7 of 
the O VI line. Thus, Oviii originates from multiple components, 
but the fastest outflow components have a larger column density 
than the slower outflow components. 



Table 12. Parameters of the velocity components for O vm in the 
outflow of Mrk 509. Numbers in parenthesis were kept fixed. 



Parameter 


Model 1 


Model 2 






comp. 1- 


3 comp. 4—7 


v(kms-') 


-115±17 


(-329) 


(+46) 


(T (kms"') 


113±7 


45 ±8 


71 ±15 


W.^dO^Om-^) 


10.7 ±1.3 


6.9 ± 2.6 


4.3 ±1.5 






total: 


11.2±2.0 


X 


4.21 




3.01 


d.o.f. 


5 




4 



For O vin we have measured velocities and equivalent widths 
for the ls-2p to ls-5p lines (Table |3]l. We have tried different 
models. The first model consists of a single Gaussian absorption 
line, with free centroid, width and column density. Model 2 has 
two Gaussian components, with the centroids frozen to the val- 
ues given in Table |4] as derived from the O vi troughs, but with 
the widths and column densities free parameters. The results are 



Table 13. Parameters of the velocity components for O vii in the 
outflow of Mrk 509. Numbers in parenthesis were kept fixed. 



Parameter 


Model 1 


Model / 








comp. 1-3 


comp. 4-7 


v(kms-') 
a (kms"') 
iV.on (10^" m-2) 


-102 ±17 

96 ±9 
5.9 ±0.7 


(-329) 

80(> 60) 

0.7(-0.1,+0.6) 

total: 7.8 ± 


(+46) 
51±8 

6.9 ± 1.6 

1.4 


X 
d.o.f. 


22.57 
5 


12.93 
4 





We have done a similar analysis for the 1 s-2p to 1 s-5p lines 
of Ovii (Table [T3Ti. The velocity broadening is consistent with 
what we obtained for O vm, although there is somewhat larger 
uncertainty. For O vii, however, the low velocity components 4- 
7 have clearly the highest colu mn density, similar to what was 
found for O vi dKriss et al.1120001) . 
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Locally, if the spectrum on the ;8-grid is given by a set of random 
variables Xj and the spectrum on the wavelength grid is given by 
y,, then we have 



20 
Wavelength (A) 

Fig. 13. Relative contribution per wavelength bin to the reduced 
X^- Calculated and shown is the ratio Var[F,]/E[F,] (see text). 
The thick line is a low-resolution spline approximation showing 
the trends in the average value. 



8. Statistics 

Local continuum fits to RGS spectra in spectral bands that have 
few lines yield in general ;t'^ values that are too low. This is an ar- 
tifact of the S AS procedures related to the rebinning of the data. 
Data have to be binned from the detector pixel grid to the fixed 
wavelength grid that we use in our analysis. However, the bin 
boundaries of both grids do not match. As a consequence of this 
process, the natural Poissonian fluctuations on the spectrum as a 
function of detector pixel are distributed over the wavelength bin 
coinciding most with the detector pixel and its neighbours. In ad- 
dition, there is a small smoothing effect caused by pointing fluc- 
tuations of the satellite. Due to this tempering of the Poissonian 
fluctuations, x^ values will be lower for a "perfect" spectral fit. 

We have quantified the efifect by fitting a linear model F^ - 
a + bA to fluxed spectra in relatively line-poor spectral regions, in 
1 A wide bins in the 7-10, 17-18 and 35-38 A ranges. The me- 
dian reduced x^ is 0.72, with a 67 % confidence range between 
0.65 and 0.79. Over the full 7-38 A range, we have at least 400 
counts (and up to 3000 counts) in each bin of ~0.02 A width. 
Therefore, the Poissonian distribution is well approximated by 
the Gaussian distribution, and the usage of chi-squared statistics 
is well justified. 

The rebinning process conserves the number of counts, 
hence the nominal error bars (square root of the number of 
counts) are properly determined. The lower reduced ;t^^ is caused 
by the smoothing effect on the data. For correct inferences about 
the spectrum, such a bias in;^'^ is not appropriate. As we cannot 
change the observed flux values, we opt to multiply the nominal 
errors on the fluxes by VO.72 = 0.85, in order to get acceptable 
values for ideal data and models. 

As a final check we have made a more robust estimate of the 
effect. For RGSl, the wavelength channel number c,i as a func- 
tion of dispersion angle channel c/j can be approximated well 
with 



O = 5.3708506 + 0.66942769c« + 1.12824091 x IQ-\1. (20) 



Yi-Y,f'J^J' 



(21) 



for a small number of bins j around the corresponding yS-channel 
given by ( |20] |. It is here important to note that while the Xj vari- 
ables are statistically independent, the F, variables are in general 
not statistically independent. Typically, the fij factors are pro- 
portional to the overlap of wavelength bin j with y6-bin /. The 
clue is now that from elementary statistical considerations we 
have for the expected value of Yj and the variance 



j 
Var[y,]=^./;5Var[X,]. 



(22) 



(23) 



For instance, when the single y6-bins j would be exactly redis- 
tributed over two adjacent /l-bins, then there are two /-values, 
each = 0.5, and we have Var[y,]/E[y,] = 0.5War[Xj]/E[Xjl 
where the factor 0.5 comes from 0.5^ + 0.5^. We have done 
the calculation exactly using the grids related through ( l20l i. and 
show our result in Fig. [13] 



9. Conclusion 

9.1. Wavelength scale and spectral resolution 

By using five different methods, we have obtained a consistent 
wavelength scale with an accuracy of about 1.8 mA, four times 
better than the nominal wavelength accuracy of RGS. The only 
slightly deviant diagnostic remains the neutral gas in our galaxy 
(Oi andN i), which are off from the mean by 8.8+3.8 mA (2.3cr). 
The systematics due to blending (for O i) and the lack of suffi- 
ciently reliable RGS data for N i may be slightly larger than we 
have estimated. 

Our work shows that in cases Uke ours a dedicated effort to 
improve the wavelength scale is rewarding. As we showed, at 
this level of accuracy, Doppler shifts due to the orbit of Earth 
around the Sun become measurable. It also allows us to use X- 
ray lines as tools to study the dynamics of the outflow and the 
Galactic foreground at a level beyond the nominal resolution of 
the instrument (see Sect.|7]l. 

9.2. Effective area corrections 

The corrections that we have derived for the relative effective 
area of both RGS detectors, as well as for the relative effec- 
tive area of both spectral orders are at the few percent level but 
should be taken into account for the high statistical quality spec- 
trum that we consider here. This is due to the fact that there are 
data gaps for each individual RGS, but each time at a different 
wavelength range. Without such a correction, there would be ar- 
tificial jumps at the boundaries of these regions of the order of 
a few percent. This leads any spectral fitting package to attempt 
to compensate for this by enhancing or decreasing astrophysical 
absorption lines or edges, in the hunt for the lowest ;t'^ value. 

A disadvantage of our method is of course that the "true" 
effective area and hence absolute flux is known only to within 
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a few percent, and the deviati ons are wavelength-d ependent. In 
our subsequent spectral fitting (iDetmers et al.l201 ih we compen- 
sate for this by using a spline for the AGN continuum. This is 
clearly justified, as one might worry about the intrinsic appli- 
cability of e.g. pure power-law or blackbody components when 
the flux can be measured down to the percent level accuracy. The 
"true" continua may contain all kinds of features including e.g. 
relativistically broadened lines (e.g. Branduardi-Raymont et al. 
l2001l:[Sako et al. 2003) that can be discerned only in the highest- 
accuracy data, such as the present one. 

9.3. Combination of spectra 

The reduction of the combined response matrices of our 40 spec- 
tra with 2 Gb of memory into a single spectrum and response file 
of only 8 Mb, a reduction by more than two orders of magnitude, 
allows us to ma ke complex spectral fi ts. In the most sophisticated 
models used bv lDetmers et al.l (1201 lb the number of free param- 
eters approaches 100, and without this eflicient response matrix 
the analysis would have been impossible. 

9.4. Final remarks 

Due to the high statistical quality of our data, the binning prob- 
lems mentioned in Sect.[8]became apparent. In many RGS spec- 
tra that have been obtained during the lifetime of XMM-Newton, 
the statistical quality of the spectra is not good enough to detect 
this effect. Still, this could imply that when users obtain a fit 
with reduced chi-squared of order unity, the fit is formally not 
acceptable because an ideal fit would yield a smaller reduced 
chi-squared. Clearly, here more work needs to be done to alle- 
viate the problem, but due to the discretisation of wavelength in 
the detector corresponding to the natural size of the CCD pix- 
els there is some natural limitation to what can be done. Even 
in the best observations there is always some small scatter in the 
satellite pointing, making some rebinning necessary. 

Overall, however, the calibration of RGS is at a fairly ad- 
vanced stage. Only for exceptionally good spectra like the 
Mrk 509 spectrum that we discussed here the sophisticated tools 
developed in this paper are needed, although also for weaker 
sources our improvements can be beneficial for the analysis. Our 
tools are publicly available as auxiliary programs (rgsfluxcom- 
bine, rgsJmat) in the SPEX distribution. 
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